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DETAILED ACTION 
Response to Amendment 

1 . In response the office action from 3/8/2004, the applicant has submitted 
an amendment, filed 6/8/2004. amending claims 1 and 29 to inlcude "acoustic 
input signar and "predetermined interference models" (Amendment pages 9-10). 
Applicant's arguments have been considered but are moot in view of the new 
grounds of rejection, necessitated by the amended claims. 

2. Claims 22-28 are rejected over previous grounds of rejection in view of 
Mizuno et al. (US Patent No. 6226614). 

Claim Rejections - 35 USC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 
U.S.C. 102 that form the basis for the rejections under this section made in this 
Office action: 

A person shall be entitled to a patent unless - (e) the invention was described in (1) an 
application for patent, published under section 122(b), by another filed in the United States 
before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, 
except that an international application filed under the treaty defined in section 351(a) shall 
have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under 
Article 21(2) of such treaty in the English language. 

4. Claims 22-28 are rejected under 35 U.S.C. 102(e) as being anticipated by 
Mizuno et al, (US Patent No. 6226614). 
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5. Regarding claim 22, Mizuno et al. disclose a method for modifying one or 
more speech synthesizer run-time control parameters, the method comprising 
the steps of: receiving the real-time data (col. 8 In. 53 to col. 9, In. 19); identifying 
relevant characteristics of the speech based on the real-time data, the relevant 
characteristics having corresponding run-time control parameters (col. 9, In. 20- 
34); and applying adjustment values to parameter values of the control 
parameters such that the relevant characteristics of the speech change in a 
desired fashion (col. 9, In. 20-34). 



6. Regarding claims 23-28, Mizuno et al. further disclose the step of 
changing relevant speaker characteristics of the speech (col. 7, In, 38-67 and col. 
9, In. 20-54), changing relevant voice characteristics of the speech (col. 7. In. 38- 
67 and col. 9, In. 20-54), changing relevant speaking style characteristics of the 
speech (col. 7, In. 38-67 and col. 9, In. 20-54, angry or glad voices are different 
style characteristics), changing relevant emotion characteristics of the speech 
(col. 7, In. 38-67 and col. 9, In. 20-54), changing relevant dialect characteristics of 
the speech (col. 7, In. 38-67 and col. 9, In. 20-54, changing the pitch information 
can produce a different accent and consequently yields dialect characteristics), 
and changing relevant content characteristics of the speech (col. 7, In. 38-67 and 
col. 9, In. 20-54). 



Claim Rejections - 35 USC § 103 
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Art Unit: 2655 

7. The following is a quotation of 35 U.S.C, 103(a) whicli forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 102 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 



8, Claimj^>3tJ are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Mizuno et al. (US Patent No. 6226614) in view of Spies (US Patent No. 
6035273). 

9. Regarding claim 1 , Mizuno et al. disclose a method for modifying 
synthesized speech, the method including the steps of: generating synthesized 
speech based on textual input and a plurality of run-time control parameter 
values (col. 8, In. 16-52 or referring to figures 2 and 3); and modifying one or 
more of the run-time control parameter values based on the real-time data such 
that the intelligibility of the speech increases (col. 8, In. 53 to col. 9, In. 54 or 
referring to figure 3). 

Mizuno et al. fail to disclose the step of generating real-time data based on 
an acoustic input signal, the acoustic input signal characterizing an intelligibility of 
the speech with regard to a listener. However, Spies teaches the step of 
generating real-time data based on an acoustic input signal, the acoustic input 
signal characterizing an intelligibility of the speech with regard to a listener (co/. 
3, In. 24-67, the text and six coefficients characterize the speech intelligibility). 
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Since Mizuno et al. and Spies are analogous art because they are fronn 
the same field of endeavors, it would have been obvious to one of ordinary skill in 
the art at the time of invention to modify Mizuno et al. by incorporating teaching 
of Spies in order to make the synthesized speech sound like the speaker. 

10. Regarding claim 7, Mizuno et al. further disclose the steps of: receiving 
the real-time data (col. 8 In. 53 to col. 9, In. 19); identifying relevant 
characteristics of the speech based on the real-time data, the relevant 
characteristics having corresponding run-time control parameters (col. 9, In. 20- 
34); and applying adjustment values to parameter values of the control 
parameters such that the relevant characteristics of the speech change in a 
desired fashion (col. 9, In. 20-34). 



1 1 . Regarding claims 8-9, Mizuno et al. further disclose the step of changing 
relevant speaker characteristics of the speech (col. 7, In. 38-67 and col. 9, In. 20- 
54), and changing relevant voice characteristics of the speech (col. 7, In. 38-67 
and col. 9, In. 20-54). 



12. Regarding claim 10, Mizuno et al. further disclose that the step of 
changing characteristics is selected from the group consisting essentially of: 
speech rate, pitch, volume, parametric equalization, formant frequencies and 
bandwidths, glottal sources, speech power spectrum tilt, gender, age, and 
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identity (coL 7, In. 38-67 and col. 9, In. 20-54, these are regarded as prosody 
information). 



13. Regarding claims 1 1-18 and 20, Mizuno et al. further disclose the step of 
changing relevant speaking style characteristics of the speech (col. 7, In. 38-67 
and col. 9, In. 20-54, angry or glad voices are different style characteristics), 
changing characteristics is selected from the group consisting essentially of: 
dynamic prosody; and articulation (col. 7, In. 38-67 and col. 9, In. 20-54. 
changing the pitch of words can yield articulating speech), changing relevant 
emotion characteristics of the speech (col. 7, In. 38-67 and col. 9, In. 20-54), 
changing an urgency characteristic of the speech (col. 7, In, 38-67 and col. 9, In. 
20-54, is considered as prosody information), changing relevant dialect 
characteristics of the speech (col. 7, In. 38-67 and col. 9, In. 20-54, changing the 
pitch information can produce a different accent and consequently yields dialect 
characteristics), changing characteristics selected from the group consisting 
essentially of: pronunciation; and articulation (col. 7, In. 38-67 and col. 9, In. 20- 
54, prosody information), changing relevant content characteristics of the speech 
(col. 7, In, 38-67 and col. 9, In. 20-54), changing characteristics selected from the 
group consisting essentially of: repetition; redundancy; and vocabulary (col. 7. In. 
38-67 and col. 9, In. 20-54. that is the intention characteristics of the speech), 
and generating the real-time data based on listener input (col. 8. In. 26-39). 



Claim Rejections - 35 USC § 103 
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14. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 102 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

15. Claims 19 and 21 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Mizuno et al. (US Patent No. 6226614). in view of Spies (US 
Patent No. 6035273), as applied to claim 1 above, in view of Logan et al. (US 
Patent No. 6199076). 



16. Regarding claim 19, Mizuno et al. fail to disclose the method of claim 1 
further includes the step of using polyphonic audio processing to spatially 
reposition the speech based on the real-time data. However, Logan et al. teach 
the step of using polyphonic audio processing to spatially reposition the speech 
based on the real-time data (col. 5, In. 13-25). The advantage of using the 
teaching of Logan et al. in Mizuno et al. is to play several sounds at the same 
time. 

Since Mizuno et al. and Logan et al. are analogous art because they are 
from the same field of endeavors, it would have been obvious to one of ordinary 
skill in the art at the time the invention was made to modify Mizuno et al. by 
incorporating the teaching of Logan et al. in order to play several sounds at the 
same time. 
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1 7. Regarding claim 21 , IVIizuno et al. fail to disclose the method of claim 1 
further includes the step of using the synthesized speech in an automotive 
application. However, Logan et al. teach the step of using the synthesized 
speech In an automotive application (col. 36, In. 37-47). The advantage of using 
the teaching of Logan et al. in Mizuno et al. is to provide audible response to 
driver to minimize driving distraction. 

Since Mizuno et al. and Logan et al. are analogous art because they are 
from the same field of endeavors, it would have been obvious to one of ordinary 
skill in the art at the time the invention was made to modify Mizuno et al. by 
incorporating the teaching of Logan et al. in order to provide audible response to 
driver to minimize driving distraction. 

18. Claim 2 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Mizuno et al. (US Patent No. 6226614), in view of Spies (US Patent No. 
6035273), as applied to claim 1 above, in view of Graciotti et al. (US Patent No. 
3903302). 

19. Regarding to claim 2, Mizuno et al. disclose the step of generating the 
real-time data (col. 8, In. 26-39), but fail to specifically disclose that generating 
the real-time data is based on background noise contained in an environment in 
which the speech is reproduced. However, Graciotti et al. teach that generating 
the real-time data is based on background noise contained in an environment in 
which the speech is reproduced (col. 4, In. 16-26). The advantage of using the 
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teaching of Graciotti et al. in Mizuno et al. is to produce a suitable speech signal 
level to increase intelligibility for listeners, 

Since IVIizuno et al. and Logan et al. are analogous art because they are 
from the same field of endeavors, it would have been obvious to one of ordinary 
skill in the art at the time the invention was made to modify Mizuno et al. by 
incorporating the teaching of Graciotti et al. in order to produce a suitable speech 
signal level to increase intelligibility for listeners. 



20. Claims 29-30 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Mizuno et al. (US Patent No. 6226614), in view of Goldberg et al. (US 
Patent No. 5970446). 



21 . Regarding to claim 29, Mizuno et al. disclose a speech synthesizer 
adaptation system comprising: a text-to-speech synthesizer for generating 
speech based on textual input and a plurality of run-time control parameter 
values (col. 8, In. 16-52 or referring to figures 2 and 3) and an adaptation 
controller operatively coupled to the synthesizer and the audio input system, the 
adaptation controller for modifying one or more of the run-time control parameter 
values (col. 17, In. 55-67 or referring to element 17 of figure 12), and the 
adaptation controller for modifying one or more of the run-time control parameter 
values based on the real-time data such that interference between the 
background noise and the speech is reduced (coA 9, In. 55 to coL 10, In. 32). 
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Mizuno et al. fails to specifically disclose an audio input system for 
generating real-time data based on background noise contained in an 
environment in which the speech is reproduced. However, Goldberg et al, teach 
an audio input system for generating real-time data based on background noise 
contained in an environment in which the speech is reproduced (co/. 4, In. 51 to 
coL 5, In. 67). 

Since Mizuno et al. and Goldberg et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of 
ordinary skill in the art at the time the invention was made to modify Mizuno et al. 
by incorporating the teaching of Goldberg et al. in order to reduce noise from the 
input signal to enhance speech recognition accuracy. 

22. Regarding to claim 30, the modified Mizuno et al. fail to specifically 
disclose that the audio input system includes an acoustic-to-electric signal 
converter. However, it would have been obvious to one of ordinary skill in the art 
that any microphone has an acoustic-to-electric signal converter in order to 
acquire speech signal from users. 



23. Claims 3-5, Wtm^^SLare rejected under 35 U.S.C. 103(a) as being 
unpatentable over Mizuno et al. (US Patent No. 6226614) in view of Spies (US 
Patent No. 6035273), in view of Graciotti et al. (US Patent No. 4903302), as 
applied to claim 2 above, and further in view of Goldberg et al. (US Patent No. 
5970446). 
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24. Regarding claim 3, the modified Mizuno et al. fail to disclose the method of 
claim 2 further includes the steps of: converting the background noise into an 
electrical signal; retrieving one or more interference models from a model 
database; and characterizing the background noise with the real-time data based 
on the electrical signal and the interference models, 

However, Goldberg et al. teach converting the background noise into an 
electrical signal (col. 4, In. 5-7, "recording indicates the conversion of sound 
signal into electrical signal); retrieving one or more interference models from a 
model database (col. 3, In. 54-64); and characterizing the background noise with 
the real-time data based on the electrical signal and the interference models (col. 
4, In. 1-67). The advantage of using the teaching of Goldberg et al. in the 
modified Mizuno et al. is to reduce noisy background signal to enhance the 
recognition accuracy. 

Since the modified Mizuno et al. and Goldberg et al. are analogous art 
because they are from the same field of endeavors, it would have been obvious 
to one of ordinary skill in the art at the time the invention was made to further 
modify Mizuno et al. by incorporating the teaching of Goldberg et al. in order to 
reduce noisy background signal to enhance the recognition accuracy. 

25. Regarding to claims 4-5, the modified Mizuno et al. fail to specifically 
disclose the method of claim 3 further includes the step of performing a time 
domain and frequency domain analysis on the electrical signal. However, 
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Goldberg et al. further teach different methods of analysis maybe used on the 
electrical signal (col. 2, In. 44-47). It would have been obvious to one of ordinary 
skill in the art that the noise signal can be analyzed by using the time-domain or 
frequency-domain analysis techniques in order to study characteristics of the 
recorded noise. 



26. Claim 6 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Mizuno et al. (US Patent No. 6226614), in view of Spies (US Patent No. 
6035273), in view of Graciotti et al. (US Patent No. 4903302), in view of 
Goldberg et al. (US Patent No. 5970446), as applied to claim 3 above, and 
further in view of Lazar (US Patent No. 5818389). 



27. Regarding claim 6, the modified Mizuno et al. fail to specifically disclose 
the method of claim 3 wherein the characterizing step is selected from the group 
consisting essentially of the steps of: identifying high level interference in the 
background noise; identifying low level interference in the background noise; 
identifying momentary interference in the background noise; identifying 
continuous interference in the background noise; identifying varying interference 
in the background noise; identifying stationary interference in the background 
noise; identifying spatial locations of sources of the background noise; identifying 
potential sources of the background noise; and identifying speech in the 
background noise. 
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However, Lazar teaches identifying high level Interference in the 
baclcground noise (col. 9, In. 32-37); identifying low level interference In the 
background noise (col. 10, In. 35-39); identifying momentary interference in the 
background noise (col. 3, In. 29-38, varying interference can be considered as 
momentary Interference); identifying varying interference in the background noise 
(col. 3, In. 29-38); Identifying spatial locations of sources of the background noise 
(col. 10, In. 6-9); identifying potential sources of the background noise (col. 5, In. 
30-33 or col. 6, In. 46-52); and identifying speech in the background noise (col. 9, 
In. 1-15). The advantage of using the teaching of Lazar in the modified MIzuno et 
al. is to identify types of interference so that the system can take appropriate 
action to produce intelligible sound to listeners. 

Since the modified Mizuno et al. and Lazar are analogous art because 
they are from the same field of endeavors, it would have been obvious to one of 
ordinary skill in the art at the time the invention was made to further modify 
l\/lizuno et al. by incorporating the teaching of Lazar in order to Identify types of 
interference so that the system can take appropriate action to produce intelligible 
sound to listeners. 

The modified MIzuno et al. still fail to specifically disclose identifying 
continuous and stationary interferences in the background noise. However, it 
would have been obvious to one of ordinary skill in the art that If the detected 
varying Interference is not varying, then it is the interference Is continuous and 
stationary. 



Application/Control Number: 09/800,925 Page 
Art Unit: 2655 

Conclusion 

Applicant's amendment necessitated the new ground(s) of rejection 
presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. 
See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as 
set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1.136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Huyen Vo whose telephone number is 703- 
305-8665. The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Doris To can be reached on 703-305-4827. The fax 
phone number for the organization where this application or proceeding is 
assigned is 703-872-9306. 




Application/Control Rumber: 09/800,925 



Page 15 



Art Unit: 2655 

Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll- 
free). 

Examiner HuyenX Vo August 16, 2003 





